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IDENTIFYING PEPTIDE LIGANDS OF TARGET PROTEINS WITH TARGET 
COMPLEMENTARY LIBRARY TECHNOLOGY (TCLT) 

FIELD OF THE INVENTION 
This invention relates to methods of screening for peptide ligands of target 

5 proteins. 

BACKGROUND OF THE INVENTION 
Rational drug designs are more and more dependent on identifying binding 
ligands which interacts with a target protein and the ability to predict their points of contact. 
Peptide library technology has become a very powerful tool in this regard. Large 

10 collections of peptides with randomized sequences are prepared and displayed on the 
surface of genetic packages, i.e., replicable entities such as cells or viruses, especially 
phage. Peptides having specific binding properties are subsequently selected by multiple 
rounds of panning (selection by binding affinity to an immobilized target) and amplification 
of the bound genetic package, such as phage (1). Phage display peptide libraries have been 

15 successfully screened to identify peptides that bind to cell surface or intracellular receptors, 
substrates or inhibitors of enzymes, and epitopes recognized by polyclonal and monoclonal 
antibodies (2-10). 

SUMMARY OF THE INVENTION 
This invention features an improved method of screening for peptide ligands 
20 of target proteins. This method is more powerful than the random peptide library 

technology in that it utilizes a predefined pool of peptides containing ligands for a selected 
target protein. The peptides in the pool exhibit complementary hydropathy to the target 
protein. Peptide ligands are selected based on their ability to bind to the target protein. 
This method is hereinafter referred to as Target Complementary Library Technology 
25 (TCLT). 

TCLT can be used in all occasions when random peptide libraries are used to 
identify ligands for target proteins. By identifying peptide ligands which can influence the 
biological activities of their target proteins, TCLT can be used to discover peptides of 
therapeutic utility. In addition, peptide ligands are useful for studying the dynamic structure 
30 of their target proteins, and such information can be used for designing functional 
pharmacological small molecules. 

Applications of TCLT include, but are not limited to, the following: 
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1) 



Identifying peptides which functionally mimic idiotypic antibodies. By 



mimicking the anti-idiotypic antibodies of the beta type (Ab2beta) (62), a peptide ligand 
can block the antigen-binding site of an antigen-specific antibody. By mimicking the anti- 
idiotypic antibodies of the gamma type ( Ab2gamma) (62), a peptide ligand can prevent 

5 antigen-binding of an entire class of antibodies, such as IgG or IgE. Specifically, the 

framework 2 and framework 3 (FR2 and FR3) segments of both the heavy chain and light 
chain of a immunoglobulin molecular can serve as the target sequence for binding peptides 
which mimic the function of an Ab2gamma antibody. Each of the complementarity - 
determining - regions (CDR1, CDR2 and CDR3) can serve as the target sequence for AB2 

10 beta-mimicking peptides. The target proteins include IgE, IgG, IgA, and specific antibodies 
such as anti-acetylcholine receptor antibodies. Peptide ligands of these target proteins can 
be used for the prevention and treatment of allergy, and autoimmune diseases including 
rheumatoid arthritis, myasthenia gravis, systemic lupus erythematosis, and autoimmune 
nephritis. 

15 2) Identifying peptides which mimic the function of a biological active 

molecule, such as a hormone, a cytokine, a growth factor, or a chemical compound. The 
target proteins include the cell surface receptors and intracellular receptors for these 
molecules, including, but not limited to, the receptors for insulin, human growth hormone, 
erythropoietin, thrombopoietin, estrogen, androgen, epidermal growth factor, interferons, 

20 interleukins, fibrinogen, endorphin, enkephalin, dopamine, nicotine, acetylcholine, 
angiotensin, and somatostatin. 

3) Identifying peptides which mimic the binding domain of a receptor. By 
binding to the receptor's native ligand, the peptide ligands act as decoys to compete with 
the receptor, thereby preventing the biological effects induced by the interaction between 

25 the receptor and its native ligand. The target proteins in this case include the biological 
active peptides and proteins, which include, but are not limited to, interleukin-1, 
interleukin-2, bradykinin, platelet activating factors, and tumor necrosis factor (TNF). 

4) Identifying peptides which mimic the receptor-binding domains of viral or 
bacterial proteins. Such peptides can block viral or bacterial infections by preventing the 

30 virus or bacteria from binding to its cellular receptors. In this case the target proteins can 
either be the known cellular receptors for virusestoacteria, such as ICAM-1 and LDL 
receptor which are the major and minor receptors for human rhinoviruses (63, 64), or a viral 
or bacterial surface protein known to bind to corresponding cellular receptors. 
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5) Identifying peptide ligands for any protein. The peptide ligand can either be 
used for affinity purification of the target protein, or as a probe for the detection of the 
target protein in vitro and in vivo. 

6 ) Identifying an unknown receptor for a known ligand, and an unknown ligand 
5 for a known receptor. In this case, the known ligand/receptor would serve as the target and 

the corresponding receptor/ligand can be identified by one of the following approaches. 
First, the complementary DNA fragment selected by TCLT can be used as a probe to screen 
another library which contains the potential receptor (23). Second, the complementary anti- 
sense peptide encoded by the complementary DNA fragment can serve as an immunogen to 

10 generate antibodies which interact with the potential receptor with specificity (24-37). 

TCLT involves constructing (i) a peptide library containing a collection of 
complementary peptides of a target protein/peptide, wherein the complementary peptides in 
aggregation complement all, substantially all (preferably no less than 70%, more preferable 
no less than 80%, even more preferably no less than 90%), or a significant portion of the 

15 target protein/peptide; or (ii) a nucleic acid library containing a collection of nucleic acids 
encoding the aforesaid complementary peptides. In the latter case, the nucleic acid library 
is inserted into expression vectors and introduced into genetic packages such as phage 
particles and the complementary peptides are expressed on the surface of the genetic 
packages. Peptide ligands are selected from the library by panning against the target 

20 protein/peptide or the binding domain of the target protein/peptide. The target 

protein/peptide or its binding domain is preferably immobilized on a solid support. Genetic 
packages which bind to the immobilized target protein/peptide can be amplified and subject 
to several more rounds of panning. Complementary peptides with sufficient specific 
affinity to the target protein/peptide are selected at the end of this process, and their 

25 sequences can be uncovered from the genome of the selected genetic packages. 

By "hydropathy character of an amino acid" is meant the hydrophobic or 
hydrophilic nature of the amino acid. Specifically, it is reflected by the hydropathy score of 
the amino acid in Tables 1-3 of the specification. The hydropathy scores of the twenty 
amino acids in Table 1-3 range from +4.5 to -4.5, and allow for the grouping of the twenty 

30 amino acids as hydrophobic (+4.5 to +1 .8, Table 1), hydrophilic (-4.5 to -3.2, Table 2), and 
slightly hydrophilic (-0.4 to -1.6, Table 3) (Ref. 1 1). Hydrophobic amino acids in Table 1 
(i.e., lie, Val, Leu, Phe, Cys, Met, and Ala) complement hydrophilic amino acids in Table 2 
(i.e., Arg, Lys, Asn, Asp, Gin, Glu, and His), and vice versa. Slightly hydrophilic amino 
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acids in Table 3 (i.e., Gly, Thr, Trp, Ser, Tyr, and Pro) complement each other. For 
example, each of Arg, Lys, Asn, Asp, Gin, GIu, and His is complementary to He; and each 
of Gly, Thr, Trp, Ser, Tyr, and Pro is complementary to Gly. In other words, each of Arg, 
Lys, Asn, Asp, Gin, Glu, and His is a complementary amino acid to lie; and each of Gly, 
5 Thr, Trp, Ser, Tyr, and Pro is a complementary amino acid to Gly. 

By "hydropathy profile of a peptide or protein" is meant its profile as defined 
by the hydropathic characters of amino acids in the peptide or protein. 

By "complementary peptide" is meant a peptide containing a contiguous 
amino acid sequence (in the direction from the amino terminus to the carboxy terminus or 
io in the direction from the carboxy terminus to the amino terminus) which complements the 
amino acid sequence of the target protein/peptide according to the rule set forth above. Any 
target protein/peptide has a plethora of complementary peptides because each amino acid in 
the target protein/peptide has a number of complementary amino acids. 

By "anti-sense peptide" is meant a peptide encoded by the anti-sense strand 
1 5 of the target gene translated in either 5' to 3* or 3' to 5* directions. For example, if the sense 
strand of the target gene has 5' CUU GUU CUU UUU 3' encoding a peptide Leu-Val-Leu- 
Phe, the anti-sense peptide would either be a peptide having the sequence Lys-Lys-Asn-Lys 
as encoded by the anti-sense strand 5' AAA AAG AAC AAG 3' or a peptide having the 
sequence Glu-Gln-Glu-Lys as encoded by 5'GAA CAA GAA AAA 3'. The matching 
20 pattern in Tables 1-3 shows that the anti-sense strand of a target gene usually encodes 
complementary peptide to the target protein/peptide. Anti-sense peptides which are 
complementary to the target protein/peptide are called complementary anti-sense peptides. 



containing a collection of complementary peptides of a target protein/peptide, wherein the 
25 complementary peptides in aggregation complement all, substantially all (preferably no less 
than 70%, more preferable no less than 80%, even more preferably no less than 90%), or a 
significant portion of the target protein/peptide; or (ii) a nucleic acid library containing a 
collection of nucleic acids encoding the aforesaid complementary peptides. In a preferred 
embodiment, the complementary peptides are linked to flanking sequences. 



target complementary library wherein the complementary peptides are also anti-sense 
peptides of the target protein/peptide. These anti-sense peptides in combination 
complement the amino acid sequence of the target protein/peptide in its entirety or 



By "target complementary library (TCL)" is meant (i) a peptide library 



30 



By "anti-sense target complementary library (anti-sense TCL)" is meant a 
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substantial entirety (preferably no less than 70%, more preferable no less than 80%, even 
more preferably no less than 90%). In a preferred embodiment, the anti-sense TCL is a 
gene fragment library wherein fragments of the anti-sense strand of the target protein are 
operatively linked to a promoter for expressing the anti-sense peptides. The sizes of the 
5 nucleic acid fragments encoding anti-sense peptides may be defined or random. The anti- 
sense peptides encoded by the gene fragments may overlap each other randomly or in a 
predefined manner. In a further preferred embodiment, fragments of the anti-sense strand 
are linked to flanking sequences. 



w complementary library containing or encoding a collection of complementary peptides. 
Each complementary peptide of the collection complements the entire length of the target 
protein/peptide. In addition, this collection includes all possible complementary peptides to 
the target. In other words, each amino acid in the target is complemented by all its possible 
complementary amino acids at the corresponding position respectively in different 

15 complementary peptides of the collection. For example, the cTCL of the target peptide Ile- 
Gly-Arg would contain a collection of complementary peptides represented by the formula 
X-(Arg, Lys, Asn, Asp, Gin, Glu, or His)-(Gly, Thr, Trp, Ser, Tyr, or Pro)-(Ile, Val, Leu, 
Phe, Cys, Met, or Ala)-Y, wherein each position of the complementary peptides is fully 
randomized according to the formula and X and/or Y represent either nothing or one or 

20 more amino acid residues. This cTCL would have at least 7x6x7 = 294 different members. 

By "target protein/peptide" is meant any protein/peptide or a domain within 
any protein/peptide. 

By "ligand" is meant a peptide or protein capable of binding directly to the 
target protein/peptide. 

25 B y "binding domain" or "interaction domain" is meant a domain or region of 

a protein which is primarily responsible for the protein's ability to bind to another protein. 

By "genetic packages" is meant replicable entities capable of carrying the 
nucleic acids of a target complementary library (TCL), including, but not limited to, 
vegetative bacterial cells, bacterial spores, bacterial viruses and eukaryotic cells. Genetic 

30 packages allow the amplification and recovery of the encapsulated genetic message 
following selection. A preferred genetic package is a phage. A phage display library 
expresses a collection of cloned proteins on the surface of phage particles as fusion with a 
phage coat protein. In connection with the genetic packages, "select" or "selection" means 



By "comprehensive target complementary library (cTCL)" is meant a target 
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enriching a subpopulation of genetic packages displaying a desired phenotypic 
characteristic (such as affinity to a probe). 

Other features and advantages of the invention will be apparent from the 
detailed description of the invention below and from the list of enumerated embodiments 
5 that follows. 

DETAILED DESCRIPTION OF THE INVENTION 
The molecular recognition theory and complementary hydropathy 

Without being bound by any theory, applicant's invention is supported by 
the molecular recognition theory (MRT) proposed by Blalock et al. in 1984 (11). 

io The molecular recognition theory arose from the observation that genetic 

codons for hydrophobic amino acids on the sense strand of a double-stranded DNA are 
complemented by genetic codons for hydrophilic amino acids on the anti-sense strand of the 
same ds DNA, and vice versa (1 1-13). The DNA sequences in which the amino acids 
encoded by one strand are hydropathically complemented by the amino acids encoded by 

15 the other strand are referred to as the complementary sequences. Two peptides encoded by 
complementary sequences in the same reading frame show a total interchange of their 
hydrophobic and hydrophilic amino acids when the amino terminus of one is aligned with 
the carboxy terminus of the other. The molecular recognition theory predicts that such 
inverted hydropathic pattern may allow pairs of complementary peptides to assume 

20 conformations that enable the pair to interact with each other specifically (11). 

This hypothesis was initially tested in the binding of corticotropin (ACTH) 
to its complementary anti-sense peptide (HTCA). Indeed, the synthetic peptide HTCA 
bound to ACTH in a saturable, high affinity, specific fashion (14). Later on, specific 
interactions between pairs of complementary peptides were shown by more than forty 

25 different proteins (summarized in 15). High affinity anti-sense peptides have been used in 
affinity chromatography to purify a number of biologically active proteins (16-19). 

It follows from the molecular recognition theory that antibodies against a 
complementary anti-sense peptide of a peptide ligand would bind to the receptor of this 
peptide ligand. In fact, scientists have used this approach to identify unknown receptors for 

30 known ligands, e.g., receptors for angiotensin II, Obronectin, vasopressin, p-endorphin, y- 
endorphin, and enkephalin (24-37). 

Recently, Martins et al. (36) identified the cellular receptor for human prion 
with an anti-sense peptide. They synthesized a complementary anti-sense peptide to the 
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neurotoxic region of the human prion (residues 106-126) and raised antibodies against this 
complementary anti-sense peptide. It turned out that these antibodies specifically stained 
the surface of neurons and recognized a 66-KD membrane protein which binds to prion 
(PrPc) both in vitro and in vivo. Furthermore, both the complementary anti-sense peptide 
5 and antiserum against it inhibit the toxicity of prion towards neuronal cells (36). 

Consistent with the molecular recognition theory, it has been reported in 
many instances that the complementary sense peptide of a ligand pairs with the binding 
domain of its receptor in the same way it pairs with its complementary anti-sense peptide at 
the DNA level and the protein level (20-23). In that regard, DNA probes derived from the 

io coding sequence of peptide ligands have been used successfully to identify cDNA clones of 
their receptors from a cDNA library (23). 

These examples show that the amino acid sequence of a receptor can be 
deduced from the amino acid sequence of its ligand at the region of their interaction, and 
vice versa, because these two sequences complement each other hydropathically. It has 

15 been shown that a pair of idiotype and antiidiotype antibodies (Abl and Ab2) can be 

generated by using as immunogens a pair of complementary sense and anti-sense peptides. 

In a frugal biological system developed under constant evolutionary 
pressure, a peptide or protein that does not interact with another peptide or protein is of 
little value to cells and organisms. The molecular recognition theory suggests that nature 

20 may have evolved a fail-safe system to ensure that this does not occur. Because of the 
complementarity of nucleic acids and the interactive qualities of the peptides encoded 
thereby, a strand of nucleic acid contains not only the information for one peptide but also 
that for a second peptide which is meant to interact with the first (38). It has been 
postulated that the interacting domains (encoded by the functional exons) of two proteins, 

25 such as a receptor and its ligand, co-evolved in this manner (38-40). 

In human genes, short complementary sequences (about 15 AA in length on 
average) are frequently found at intervals of approximately fifty amino acids (15, 41). The 
majority of complementary sequences are amphophilic containing reverse turns, implying 
that these regions are on the molecular surface at the beginning or end of automatic folding 

30 units, helices and beta sheets. This phenomenon implies that these regions are important 
for folding and maintaining the structural integrity of proteins (15). Therefore, 
complementary anti-sense peptides which bind to a target protein would affect the 
biological activities of the target protein. Numerous examples have illustrated that 
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biologically active proteins such as antibodies, cytokines, and hormones can be functionally 

affected by the binding of complementary anti-sense peptides (42-56). 

According to the molecular recognition theory, peptide ligands for any target 

protein could be identified among the complementary peptides of the target protein, 
5 numerous experimental observation also reveal the specific binding between peptides with 

opposite hydropathy profiles. Therefore, an efficient way of identifying ligands for a target 

is to screen a pool of complementary peptides. 

Sequences of complementary peptides to a target protein/peptide can be 

constructed by using computer programs (z.g.>see 15, 70). However, because the "native" 
1 0 hydropathy of a peptide is not only determined by its sequence, but also strongly influenced 

by the flanking regions, a better TCL is prepared by linking appropriate flanking amino 

acids to complementary peptides (15). 

A relatively simple way of constructing a TCL is to prepare a gene fragment 

anti-sense TCL. In this method, the TCL is prepared by amplifying fragments of the target 
1 5 gene by polymerase chain reaction (PCR) using random hexamer oligonucleotide primers or 

other universal primers (57). Because sequences encoded by the anti-sense DNA strand of 

the target gene is just as well represented as that of the sense strand, a complete collection 

of the anti-sense peptides with their various flanking sequences are encoded by 

approximately one half of the genetic packages so prepared. Optionally, the fraction 
20 encoding the sense strand peptides could be excluded by subtractive hybridization 

following the generation of a single stranded nucleic acid (e.g., by transcription). 

A cTCL can be synthesized as follows: residues with complementary 

hydropathy scores (as defined in Tables 1-3 and refs. 1 1-13) to each residue in the target 

sequence are randomly incorporated into the corresponding positions in the library. As a 
25 result, the TCL contains all the peptides which complement the target sequence 

hydropathically. This method is particularly suitable for screening for peptide ligands when 

the target is a short region in a protein of known sequence. 

TCL can be displayed on phage as either fusion proteins with the minor 

phage coat protein pill or the fusion proteins with the major phage coat protein pVIII. 
30 Three to five copies of pin fusion proteins are expressed per phage particle, therefore they 

are used to display peptides with good binding affinities. Because pVin fusion proteins are 

expressed at a much higher level at about 2,700 copies per particle, they can be used in a 

first screen when the binding affinities are suspected to be weak (1). 
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To select for peptides in the cTCL which have higher affinity to the target 
than the complementary anti-sense peptide, the complementary anti-sense peptide is 
synthesized and included in the eiution buffer used to elute phage particles from the 
immobilized target protein/peptide. The concentration of the complementary anti-sense 

5 peptide determines the stringency of the eiution buffer, the higher the concentration, the 
higher the stringency. Phage particles which withstand the high stringency eiution carry 
peptides with high affinity to the target protein/peptide. 

Once a binding peptide is identified through an initial screening, the 
sequence of this peptide is used as a guide for the generation of a dedicated library which 

10 contains various mutants of the initial binding peptide. Peptides with better binding affinity 
than the binding peptide from the initial screening will be selected by panning the dedicated 
library against the immobilized target protein. This procedure is described in detail in U.S, 
patent 5,223,409, incorporated by reference herein in its entirety. 

Peptides with higher affinity can also be generated by linking a peptide 

15 identified by TCLT to the polymerization domain of a coiled-coil protein (58), such as 
GCN4(59) or COMP(60), to create a tetravalent or pentavalent binding molecule. An 
increase in affinity by itf-lO 6 folds has been demonstrated with this approach (60). 

Occasionally, high-affinity ligands could not be found from libraries of 
linear and unstructured peptides because a constrained peptide with the right conformation 

20 confers more thermodynamic advantage than a peptide which changes its shape constantly 
(61). This can be remedied by introducing some structural constraints such as disulfide 
bonds to the peptides, to enhance the chance of finding a high-affinity ligand. A 
constrained TCL can be prepared by carrying PCR reaction with cytosine-containing 
random hexamer oligonucleotide primers in which either cytosine is present at various 

25 positions in every hexamer oligonucleotide primers, or one additional cytosine is added at 
the 5*- or 3'- end of the random hexamer oligonucleotide sequences. Cystein can be 
synthesized at both ends of the synthetic library. 

Example 1: Design and Construction of a Comprehensive Target Complementary 
Library fcTCU 

30 Phage fUSE5 is used as the vector for the cTCL. The library is made by 

ligation of synthetic degenerate Bgll deoxyoligonucleotides fragments into the Sfi I site of 
the fUSE5, and transfection of E. coli with the ligation products by electroporation. The 
fUSE5 vector and the cJoning procedure is described in detail in ref 71, 72 and 89. The 
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synthetic degenerate Bgll deoxyoligonucleotides fragments are synthesized fay solid-phase 
phosphoramidite chemistry (90), and carry the following sequence: 

5*-CTGTCAGGGCCCGAGGGGCT(XXX)nGGGGCCGCTGCGGCCTGTCAGG- 
3' (SEQ ID NO. 1) 

5 In the above sequence, n is the number of amino acid residues in the target 

peptide. The degenerate sequence of (XXX)n is designed by the following principle: 
hydrophobic amino acids which are complementary to the hydrophilic amino acids in the 
target peptide are randomly incorporated in the same positions in cTCL; hydrophilic amino 
acids which are complementary to the hydrophobic amino acids in the target peptide are 
io randomly incorporated in the same positions in sTCL, Slightly hydrophilic amino acids are 
used in sTCL corresponding to similar residues in the target peptide. The hydrophilic and 
hydrophobic amino acid residues are defined by their hydropathic scores as given in Tables 
1-3. 

To synthesize XXX triplets for hydrophilic amino acids, the second base 
1 5 comprises a mixture containing A and G at molar ratio 6: 1 ; the first base consists of 
equimolar mixture of A, G, and C. Thymine is not included in the mixture because its 
presence in the first position would give rise to the stop codon TAA or TAG (91). The 
third base includes only an equimolar mixture of G and C which is designed to favor 
codons used by E. coli to express its most abundant proteins (91). 

To synthesize XXX triplets for hydrophobic amino acids, the second base 
comprises a mixture containing T and C at molar ratio 5:2; the first base consists of mixture 
containing an A:T:C:G molar ratios of 3:3:3: L The third base again includes only an 
equimolar mixture of G and C. 

To synthesize XXX triplets for slightly hydrophilic amino acids, the first 
25 base consists of an equimolar mixture of A, T, C and G; the second base contains a mixture 

of G, C, and A molar ration 2:2: 1, and third base includes an equimolar mixture of G and C. 

The oligonucleotides are purified by denaturing PAGE, their complementary 

strands are synthesized by Klenow DNA polymerase (91). They are subsequently digested 

with Bgll and ligated with fUSE5 (71, 72, 89). 
30 A cTCL carrying peptides with ten amino acid residues will contain 

approximately 2.8xl0 8 independent clones, which can be completely amplified as a phage 

display library. A cTCL carrying peptides with eleven amino acids will have approximately 

2xl0 9 clones. 



20 
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Example 2: Blocking antigen binding activity of an antibody with peptide ligands 

targeting the second framework segment CFR2) in the heavy and light chain variable 
regions fVH and VL) 

Variable regions of both the heavy chain and light chain of an 
5 immunoglobulin molecule are composed of four framework segments (FR1 , FR2, FR3 and 
FR4) and three complementarity-determining segments (CDR1, CDR2 and CDR3) with the 
three CDRs separating the four FRs. The residue numbers for these segments were defined 
by Kabat et al. (80). The CDRs are the segments with highly variable amino acid sequences 
and are primarily responsible for antigenic specificity of each antibody. The FRs are the 
io relatively conserved segments which provides structural framework for the antigen-binding 
domain of an antibody. The FR2 of light chain constitutes amino acid 35 to 49, and FR2 of 
heavy chain constitutes amino acid 36 to 49 (80). 



peptide loops which represent six CDRs of both heavy and light chains {8 1). The FR2 
15 segments of each chain also directly contribute to an antigen-binding site. The FR2 
segments of heavy and light chains constitutes the side and base of the antigen-binding 
surface formed by six CDRs, and appear to be an integral part of the site. Parts of the FR2 
seem to be as exposed as the CDR3 loops. In addition, the closest contact points between 
VH and VL domains of Fab are between the FR2 of the heavy chain and the CDR3 of the 
20 light chain, and vice versa. Therefore, the FR2 plays an important role in keeping the 
structural integrity of Fab since YH and VL need to be closely apposed to each other to 
form a complete antigen-binding site. Consequently, a polypeptide (or a chemical 
compound) which specifically binds to the FR2 segment can disrupt the antigen-binding 
ability of an antibody molecule by (1) altering the topography of an antigen-binding site and 
25 thereby preventing antigen binding; and/or (2) acting as a structural hindrance to prevent 
close contact between the FR2 and the CDR3s in both chains and thereby disrupting the 
structural integrity of an antigen-binding site. 



targeting FR2 may mask antigen-binding activity of antibodies regardless of their antigenic 
30 specificities. Furthermore, peptides targeting the FR2 segments in antibodies of different 
isotypes can be used to mask antigen-binding activities of all the antibodies of the same 
isotype, i.e., IgE, IgG, IgA, IgM and IgD. 



The antigen-binding site is formed by the convergence of six hyper-variable 



Because FR2 is the most conserved segment in the variable region, a peptide 



An FR2 targeting peptide can be identified by the following approaches; 
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Synthesize a TCL based on the sequences of FR2 segments and the 



complementary hydropathy and select peptide ligands by their ability to bind to and disrupt 
the functions of antibodies with different antigenic specificities. The following examples 
illustrate how to identify anti-IgE peptides. 



complementary sequences by computer programs (15, 70). Such approach generally 
produce peptides with binding affinity within the range of lOMo 6 M' 1 . The affinity can be 
improved by screening a constrained TCL which contains various flanking sequences and 
cysteins at both sides of the complementary peptides (61). 

Example 3: Blocking Human Rhinovirus fHRV) Major Receptor (ICAM-1) with Peptide 



the cellular receptor for majority of HRVs (82). The extracellular part of the ICAM- 1 
molecule is composed by five immunoglobulin-like domains (D1-D5). Mutational analysis 
of ICAM-1 has shown that domain Dl contains the primary binding site for rhinoviruses as 
well as the binding site for its natural ligand lymphocyte function-associated antigen 1 
(LFA-1) (83-86). The regions in Dl which have been implicated as the contact sites with 
HRVs include residues 1, 2, 24-29, 40-49, and 70-77 (83-87). Accordingly, peptide ligands 
of ICAM-1 targeting these regions may prevent the binding of HRVs to ICAM-1 . Suitable 
peptide targets include, but are not limited to the following: 

Residues 1-5: QTSVS (SEQ ID NO. 2) 

Residues 24-29: SCDQPK (SEQ ID NO. 3) 

Residues 40-49: KELLLPGNNR (SEQ ID NO. 4) 

Residues 70-77: PDGQSTAK (SEQ ID NO. 5) 



identified by the following procedure: 

(1) Preparing cTCL (or constrained cTCL) for each target peptide, and displaying 
the library on phage; (2) Immobilizing each target peptide on a solid surface (Alternatively, 
the D1-D2 part of ICAM-1 molecular is expressed and purified in vitro as described in ref 
88, coated onto a solid surface, and used to screen cTCLs); (3) Selecting phage particles by 
panning as described in detail above; and (4) Recovering the peptide sequences from the 
selected phage particles. 



A very simplified version of this approach is to identify the FR2 



Ligands 



Human rhinoviruses cause about 70% of common cold. ICAM-1 serves as 



Peptide ligands capable of binding to each of the above target peptides are 
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Peptide ligands identified by screening cTCLs are synthesized and tested for 



their ability to prevent HRVs infection to Hela cells. The experimental protocols are 
described in US provisional patent application 60/067,1 19 filed December 1, 1997, 
incorporated by reference herein. 



5 



The affinity of the peptides can be improved by linking the peptides with the 



polymerization domain of a coiled-coil protein to create a multivalent binding molecule as 
described in detail in provisional patent application 60/067,119. 

Example 4: Identifying anti-IgE peptides with TCLT 
The most common form of allergic reaction is allergic rhinitis (nasal allergy) 
10 which affects about 20% of population (66), Allergic rhinitis can be either seasonal which 
indicates pollen-allergen sensitivity (also referred to as hay fever), or throughout the year 
(perennial rhinitis) which tend to be caused by house dust, dust mites, and animal danders 
(66). Allergic rhinitis is associated with irritating, sometimes disabling symptoms 
involving upper respiratory tract and eyes, it also imposes morbidity by leading to frequent 
15 secondary infections and even asthma attacks. 

At cellular level, nasal allergy is caused by inflammatory mediators released 
by the mast cells, such as histamine, serotonin, prostaglandins, and various cytokines. 
Nasal allergy is also associated with basophils which have been presensitized by IgE. 

IgE is the least component of the immunoglobulin family in human and 
20 animals. Its concentration in normal human serum is very low. Most IgE antibodies bind to 
the surface of mast cells and basophils through the Fc type 1 receptors (FcRI). Binding of 
IgE to the corresponding allergen causes clustering of IgE molecules which in turn activates 
mast ceils and basophils and results in the release of inflammatory mediators. 

Many attempts have been made to block the binding of IgE to FcRI as a way 
25 to prevent allergy. However, none of them have been successful so far in dissociating the 
preformed IgE-FcRI complex because the affinity between IgE and FcRI is as high as any 
good antibody-antigen interaction with the affinity constant measuring in the range of 
10' 10 M" 1 (67, 68). In addition, trying to prevent allergy by antibodies against other parts of 
the IgE molecule is almost bound to fail because an anti-IgE antibody can cause cross- 
30 linking of the IgE molecules on cell surface and thereby activate the mast cells and 
basophils and triggers an allergic reaction. 

One approach of preventing allergy is to mask the antigen-binding site of 
IgE with a peptide which only binds to a single IgE molecule and does not cause cross- 
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linking of the IgE molecules. Such peptides target IgE at or near the antigen binding site 
and prevent antigen binding by causing steric hindrance to the binding site. 

It would be preferable that the peptides target the conserved framework 
regions rather than the hypervariable regions in order to affect IgE molecules of a wide 
5 range of antigenic specificities. 

TCLT is suitable for finding such peptides. First, it is well-established that 
the antigen binding site in an antibody molecule (Abl) can be interfered by anti-idiotype 
antibodies (Ab2) against this antibody (Abl). One type of anti-idiotype antibody, Ab2beta, 
is the direct mirror image of the antibody binding site on Abl . Another type of anti- 
10 idiotype antibody, Ab2gamma, is directed against an idiotype near the binding site and 
causes steric hindrance to the antigen binding site (62). Second, idiotypes of an antibody 
are composed by peptide sequences found in both the hypervariable regions (CDRs) and the 
conserved framework regions of a antibody molecule (62, 69, 70). Therefore, a peptide 
targeting the conserved region of an idiotype can alter the idiotype and affect the antigen- 
15 binding ability of an entire class of antibody, such as IgE, IgG and IgA. Kang et al. have 
demonstrated the presence of complementary sequences in human immunoglobulin (52). 

Peptides affecting antigen-binding ability of human IgE molecules can be 
identified with TCLT as described in the following examples. 

Example 5: Blocking antigen-binding ability of an antibody molecule by targeting the 
20 framework 2 TFR2) region of the antibody 

Based on their sequences, the heavy chain (H) of an antibody can be classified into 
six families (V H 1 to V H 6). The general sequences of FR2 region in each family are in the 
following: 



V L kappa: W YQQKPGQ/KP/S/APKLLI Y(SEQ ID NO. 12) 
Human IgE molecules are mostly composed of heavy chain genes from the V H 5, 
V H 3, V H 4 and V H 6 families (92,93) T therefore the FR2 sequences from these families plus 



25 



30 



V H 1: W V R/Q Q A P/H/T G/A K/Q G/E/R/A L E/G W M/I G (SEQ ID NO. 6) 

V H 2: WIRQPPGKALEWLA (SEQ ID NO. 7) 

V H 3: W V/I R/H Q A P/Q G K G UP E/V W/Y/L V S/A/G (SEQ ID NO. 8) 

V H 4: WI/VRQPPGKGLEWIG (SEQ ID NO. 9) 

V H 5: WVRQMPGK G/E LEWMG (SEQ ID NO. 10) 

V H 6: WIRQSPSRGLEWLG (SEQ ID NO. 11) 
The sequence of the light chain (L) of an antibody is: 
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the FR2 from the light chain serve as the target sequences for an IgE blocking polypeptide 
in this invention. 

The IgE-blocking peptide can be designed directly based on the molecular 
recognition theory as in the following: 
5 Peptide 1 (binds to FR2 in V H 5): PDALHGPFA Q(or D)LPHP (SEQ ID 

NO. 13) 

Peptide 2(binds to FR2 in V H 3, V H 4 and V H 6): PDALG/RGPFAQ/DLPNP 
(SEQ ID NO. 14) 

Peptide 3(binds to FR2 in V L kappa chain): PVLLFRPLRGFEEDI (SEQ ID 
10 NO, 15) 

To isolate peptides with higher affinity to the FR2 region of IgE, a human IgE anti- 
sense TCL or comprehensive TCL library can be screen using the above FR2 target 
sequences. 

An alternative way to generate human IgE blocking polypeptide which targets the 
15 FR2 region is to screen from a phage display human antibody library. In this case, each 
peptide encoded by the targeted FR2 sequence is synthesized artificially and used for 
panning as described below. 

Example 6: Constructio n of a human IgE anti-sense TCL on bacteria phage 

The human IgE anti-sense TCL is prepared by using coding sequences of the 

20 cDNA clones of human IgE light chain and heavy chain. The cDN As of human IgE are 
separated from the vector sequences by agarose gel electrophoresis. Equal amounts of each 
purified inserts are mixed and used as templates for a random priming reaction. 

Approximately 50 ng of the mixed template is boiled with 1 pmoi of the 
primer No. 1 (5'-GACGTGGCOs{>-3\ N can be A, T. C, or G) for 3 min, cooled on ice, 

25 mixed with a reaction mixture containing 10 mM Tris.HCl (pH 7.5), 10 mM MgC12, 1 mM 
dithiothreitol, 0.125 mM each dNTP, and 2.5 units of Klenow fragment of DNA 
polymerase and incubated at 37 °C for 15 min. Then the reaction mixture is boiled and 
cooled again, another 2.5 units of Klenow fragment is added, and the reaction mixture is 
incubated at 37 °C for 15 min. 

30 The taction is stopped by boiling and diluted 1:10 with TE buffer and 

products are separated from primers using Microcon-100 microconcentrators (Amicon) 
with two washes of the retained solution with 200 ml of the TE buffer. After two cycles of 
priming, some reaction products will incorporate primer sequences at both ends. They are 
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amplified in PCR using primer No. 2 ( S'-GGCCGACGTGGCC-O') (SEQ ID NO. 16). The 
amplified products are precipitated, purified with Microcon-100, cut with sfil and cloned 
into the sfil site of the fUSE5 vector (71, 72). Amplification of the IgE-TCL follows the 
methods described by Smith and Scott (71), the details which are given in the manual 
5 supplied with the fUSE expression kit by Smith (72). 

Primer No.l: 5 * -G ACGTGGCCTGTN6-3 * (SEQ ID NO. 17) and primer No. 
2: 5 T -GGCCGACGTGGCCTGT-3 ' (SEQ ID NO. 18) are used to generate a constrained 
IgE-TCL. 

Example 7: Selection of the IgE binding complementary anti-sense peptide by panning 

10 the human IgE TCL 

Human IgE is diluted in PBS to a concentration of 20 mg/ml and is used to 
coat 3.5 cm wells by incubating for 1 h at 4°C. The remaining binding sites are saturated 
by bovine serum albumin (BS A). A portion of the amplified IgE-TCL is first incubated for 
2 h at 4 °C in a 3.5 cm well precoated with 1 mg/ml BSA in PBS and 1 mM MnCLL The 

15 phage unbound to BSA are transferred to a similar well precoated with human IgE. After 
incubation for 1 h at 4 °C, the unbound phage are removed by washing 10 times with PBS 
buffer containing 0.5% Tween 20. The bound phage are eluted with 0. 1 M glycine buffer, 
pH 2.2, containing 1 mg/ml BSA and 0. 1 mg/ml phenol red. 

The phages are amplified using the K91kan bacteria and partially purified by 

20 precipitation with polyethylene glycol (72). The panning is repeated for two more rounds. 
Sequences carried by the selected phage are then determined using the Sequenase kit 
(United States Biochemical) with the primer 5 ' -CCCTC AT AGTT A AG CGT A ACG-3 ' (73) 
(SEQ ID NO. 19). 

Example 8: Characterizing anti-IgE activities of the complementary anti-sense peptides 
25 The peptides selected from the previous step are synthesized and 

individually tested in the following assays. 

Assay 1 : Histamine release by passively sensitized basophils. 

The procedure is described in detail previously (74). Briefly, reaginic sera 
are obtained from allergic patients who are allergic to ragweed, rye grass, Chinese elm or 
30 other preidentified allergens. Peripheral blood mononuclear cells are isolated from normal 
individuals by differential centrifugation on Ficoll-Pague (Pharmacia). Aiiquots of a cell 
suspension are incubated with 2- to 10-fold dilution of a reaginic serum, Tris A buffer 
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containing 0.025 M Tris, 0.12 M NaCL 0.005 M KC1, and 0.05% human serum albumin 
(pH 7.5) at 37 °C for 90 min. 

The cells are then washed three times with Tris A buffer and resuspended in 
Tris- ACM buffer (pH 7.6) which consists of Tris A buffer plus 0.6 mM CaC12 and 0.01 
5 mM MgC12. They are incubated with peptides at various concentrations for 30 min at 37 
°C and then are challenged with either purified ragweed antigen E (0.1 mg/ml) or crude 
ragweed antigen, crude Chinese elm extracts, and rye grass extracts (Hollister-Stier, 
Spokane, WA). 

After 40-min incubation at 37 °C, histamine content in supernatants is 
10 measured by the method of Siraganian (75). The peptides which can prevent histamine 
release triggered by ail allergens will be selected. 

Assay 2: Prausnitz-KDstner reactions 

The procedure is carried out as described previously (74, 75). The reactions 
are done in a single healthy individual whose serum IgE concentration is at low level 
15 (around 10 ng/ml). To determine the abilities to block passive sensitization, 20 ml of 

various concentrations of the peptides is injected intracutaneously into skin sites 1 hr before 
sensitization of the skin sites with 20 ml of a 50-fold dilution of allergic serum (IgE at 9 12 
ng/ml). 

In each experiment, control site receives only the diluting buffer before the 
20 sensitization. All skin sites are challenged 48 hr after the passive sensitization with allergen 
extract, such as crude ragweed antigen. The size of wheat and erythema reactions are 
measured 20 min later by using transparent tape to transfer the outline of the reactions to a 
paper and expressed by the weight of the paper (76). It has been demonstrated that 
unsensitized skin sites do not show any wheat or flare reaction upon antigen challenge. 
25 Peptides which can prevent or lessen the reactions will be selected. 

Example 9: Measuring affinity of the IgE-binding peptides and improvement of the 
peptides 

The binding of peptides to human IgE can be analyzed by BIAcore biosensor 
(Pharmacia Biosensor AB, Uppsala, Sweden) (77, see 78 for review). Experiments are 
30 carried out at 25 °C in 10 mM HEPES buffered saline (HBS) with 150 mM NaCl, 3.4 mM 
EDTA and 0.005% Surfactant P20 (Pharmacia). 

Human IgE is coupled to the sensor chip by activating the 
carboxymethylated dextran matrix with 0.2 M N-ethyl-N'-fS-diethylaminopropyl)- 

- 17- 



WO 99/5591 1 %W PCTAJS99/06537 

carbodiimide and 0.05 M N-hydroxysuccinirnide (NHS) (Pharmacia Amine Coupling Kit), 
followed by addition of the human IgE (80 mg/ml, 10 mM sodium acetate, pH 4.5). 
Remaining NHS-esters are blocked with a pulse of 1.0 M ethanolamine hydrochloride, pH 

8.5. 

5 To monitor the binding of peptides to IgE, various concentrations of peptide 

are injected over the immobilized IgE using conditions where binding is limited by kinetic 
parameters. Under these conditions, the solution association constant (KA) for the IgE- 
peptide interaction can be calculated using the following equation: 

[SL]=KA[S]t[L]/(H-KA[L]) (79) 

10 where [SL] is the concentration of the IgE-peptide complex at equilibrium, [L] is the 
concentration of the free peptide, and [S]t is the total concentration of IgE. 

Because the peptides selected in the present invention prevent the binding of 
allergens to IgE by steric hindrance rather than directly competing with the allergens for 
binding sites on IgE, peptides with apparent affinity comparable or even lower than the 

15 affinity between IgE and allergens can be effective in preventing the binding of allergens as 
long as enough concentration of the peptides is maintained locally so that they remain 
bound to IgE when the presensitized mast cells or basophils encounter the allergens. 
Nevertheless, if peptides with higher affinity are desired, an in vitro affinity maturation 
process can be performed using the TCLT-selected peptides as templates. The details of 

20 this approach are described in U.S. patent 5,223,409, incorporated by reference herein in its 
entirety. 

The human IgE FR2 segment binding peptides can also be selected by 
constructing a cTCL against the IgE FR2 segment, and screening the cTCL with a synthetic 
peptide containing the IgE FR2 sequence. The screening procedure and characterization of 
25 the selected peptides are carried out as described above. 
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What is claimed is : 

1 . A method of screening for peptide ligands of a target protein, comprising the 
steps of: 



5 peptides of said target protein, wherein said complementary peptides in aggregation 
complement substantially the entire length of the said target protein; 
expressing said peptides from said recombinant nucleic acids; 
bringing said target protein into contact with said peptides expressed from said 
recombinant nucleic acids; and 
10 selecting one or more peptides expressed from said recombinant nucleic acids which 

bind to said target protein as peptide ligand or ligands of said target protein. 

2. The method of claim 1 , wherein said collection of recombinant nucleic acids 
are introduced into a plurality of genetic packages for expressing peptides encoded by said 

15 recombinant nucleic acids; said target protein is brought into contact with said genetic 

packages for selecting one or more peptides expressed from said recombinant nucleic acids 
with the highest affinity to said target protein. 

3. The method of claim 2, wherein said genetic packages are selected from the 
20 group consisting of vegetative bacterial cells, bacterial spores, bacterial viruses and 

eukaryotic cells. 

4. The method of claim 2, wherein said genetic packages are bacteriophages 
and said peptides are expressed on the surface of the particles of said bacteriophage. 



5. The method of claim 1, wherein said complementary peptides are anti-sense 
peptides to said target protein. 

6. The method of claim 5, wherein said collection of recombinant nucleic acids 
30 is prepared by generating random fragments from a nucleic acid encoding said target 

protein and inserting said fragments into expression vectors. 



providing a collection of recombinant nucleic acids encoding complementary 



25 
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What is claimed is: 



1. 



A method of screening for peptide ligands of a target protein, comprising the 



steps of: 

providing a collection of recombinant nucleic acids encoding complementary 
peptides of said target protein, wherein said complementary peptides in aggregation 



complement substantially the entire length of the said target protein; 

expressing said peptides from said recombinant nucleic acids; 

bringing said target protein into contact with said peptides expressed from said 
recombinant nucleic acids; and 

selecting one or more peptides expressed from said recombinant nucleic acids which 
bind to said target protein as peptide ligand or ligands of said target protein. 

2. The method of claim 1, wherein said collection of recombinant nucleic acids 
are introduced into a plurality of genetic packages for expressing peptides encoded by said 
recombinant nucleic acids; said target protein is brought into contact with said genetic 
packages for selecting one or more peptides expressed from said recombinant nucleic acids 
with the highest affinity to said target protein. 

3. The method of claim 2, wherein said genetic packages are selected from the 
group consisting of vegetative bacterial cells, bacterial spores, bacterial viruses and 
eukaryotic cells. 

4. The method of claim 2, wherein said genetic packages are bacteriophages 
and said peptides are expressed on the surface of the particles of said bacteriophage. 

5. The method of claim 1, wherein said complementary peptides are anti-sense 
peptides to said target protein. 

6. The method of claim 5, wherein said collection of recombinant nucleic acids 
is prepared by generating random fragments from a nucleic acid encoding said target 
protein and inserting said fragments into expression vectors. 
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7. 



The method of claim 6, wherein said random fragments are generated by 



amplifying said nucleic acid encoding said target protein with random hexamer 
oligonucleotide primers. 

8. The method of claim 1 , wherein said target protein is selected from the 
group consisting of IgE, IgA, IgG, IgD, IgM, ICAM-1 and LDL receptor. 

9. The method of claim 1, wherein said target protein is a framework 2 segment 
from a heavy chain or light chain of an immunoglobulin molecule. 

10. A method of treating allergy and autoimmune diseases, comprising the usage of 
polypeptides or chemical compounds which bind to the framework 2 region of the human 
immunoglobulin. 

1L The method of claim 10, wherein said allergy comprises allergic rhinitis and 

asthma. 

12. The method of claim 10, wherein said autoimmune diseases comprises 
rheumatoid arthritis, myasthenia gravis, systemic lupus erythematosis, and autoimmune 
nephritis. 

13. The method of claim 10, wherein said polypeptide is a complementary peptide 
to the framework 2 region. 

14. The method of claim 10, wherein said polypeptide is an antibody against the 
framework 2 region. 



15. The method of claim 10, wherein said human immunoglobulin comprises IgG, 
IgA, IgE, IgM and IgD. 
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(1} GENERAL INFORMATION: 

5 (i) APPLICANT: Fang Fang 

£ii) TITLE OF INVENTION: IDENTIFYING PEPTIDE LIGANDS OF TARGET 
PROTEINS WITH TARGET COMPLEMENTARY LIBRARY TECHNOLOGY (TCLT) 

10 (iii) NUMBER OF SEQUENCES: 18 

(iv) CORRESPONDENCE ADDRESS: 

I A) ADDRESSEE: Lucy X. Yang 
(B) STREET: 3718 Winding Way 
15 (C) CITY: Kalamazoo 

(D) STATE: MI 

(E) COUNTRY: USA 

(F) ZIP: 49004 

20 (v) COMPUTER READABLE FORM; 

IBM, Microsoft Word 6.0 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
25 (B) FILING DATE: 

<C) CLASSIFICATION: 

(viii) ATTORNEY / AGENT INFORMATION: 
(A) NAME: Lucy X. Yang 
30 (B) REGISTRATION NUMBER: 40,259 

(ix) TELECOMMUNICATION INFORMATION: 
(A) TELEPHONE: 616-385-5824 
{B) TELEFAX: 616-385-5824 



35 



55 



(2) INFORMATION FOR SEQ ID N0:1: 



{i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 45 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(E) FEATURE: X residue designated in (XXX) n can be A, C, T, U, 
45 or G; n can be any integer. 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 
50 (iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO : 1 : 

ctgtcagggc ccgaggggct (XXX)nggggccg ctgcggcctg tcagg 45 



(2) INFORMATION FOR SEQ ID NO:2: 



fi) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 5 amino acids 

(B) TYPE: amino acids 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

65 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 



25 
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(iv) ANTI- SENSE; NO 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

5 

Glu Thr Ser Val Ser 
1 5 

10 (2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acides 

(B) TYPE: amino acids 

15 (C) STRANDEDNESS : single 

(D) TOPOLOGY; linear 

(ii) MOLECULE TYPE: peptide 

20 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 



Ser Cys Asp Glu Pro Lys 
1 b 



30 (2) INFORMATION FOR SEQ ID NO: 4: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acids 

35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

40 <iii) HYPOTHETICAL: NO 

(iv) ANTI - SENSE : NO 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO:4; 



Lys Gin Leu Leu Leu Pro Gly Asn Asn Arg 
5 10 



50 (2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acids 

55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

60 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 5: 



Pro Asp Gly Glu Ser Thr Ala Lys 
1 5 



■2- 



40 



45 



WO 99/55911 



PCT/US99/06537 



(2) INFORMATION FOR SEQ ID NO: 6: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 (E) FEATURES: Xaa at position 3 can be Arg or Glu; Xaa at 

position 6 can be Pro, His or Thr; Xaa at position 7 can be Gly or Ala; 
Xaa at position 8 can be Lys or Glu; Xaa at position 9 can be Gly, Gin, 
Arg or Ala; Xaa at position 11 can be Gin or Gly; Xaa at position 13 can 
be Met or He. 

15 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

20 (iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO; 6: 

Trp Val Xaa Glu Ala Xaa Xaa Xaa Xaa Leu X Trp Xaa Gly 
25 1 5 10 

(2) INFORMATION FOR SEQ ID NO: 7: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
35 <ii> MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 



Trp He Arg Glu Pro Pro Gly Lys Ala Leu Gin Trp Leu Ala 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 14 amino acids 

<B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(E) FEATURES: Xaa at position 2 can be Val or He; Xaa at 

55 position 3 can be Arg or His; Xaa at position 6 can be Pro or Glu; Xaa at 
position 10 can be Leu or Pro; Xaa at position 11 can be Gin or Val; Xaa 
at position 12 can be Trp, Tyr or Leu; Xaa at position 14 can be Ser, Ala 
or Gly. 

60 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

65 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8 : 



Trp Xaa Xaa Glu Ala Xaa Gly Lys Gly Xaa Xaa Xaa Val Xaa 

-3- 
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(2) INFORMATION FOR SEQ ID NO: 9: 

5 

<i> SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 
10 (D) TOPOLOGY: linear 

(E) FEATURES: Xaa at position 2 can be lie or Val . 

(ii) MOLECULE TYPE: peptide 
15 (iii) HYPOTHETICAL: NO 

(iv) ANTI- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 



20 



Trp Xaa Arg Qlu Pro Pro Gly Lys Gly Leu Gin Trp lie Gly 
15 10 



25 (2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

30 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(E) FEATURES: Xaa at position 9 can be Gly or Gin. 
(ii) MOLECULE TYPE: peptide 

35 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Trp Val Arg Glu Met Pro Gly Lys Xaa Leu Gin Trp Met Gly 
15 10 

45 

(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 
50 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: peptide 

55 (iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

60 

Trp lie Arg Glu Ser Pro Ser Arg Gly Leu Gin Trp Leu Gly 
15 10 



65 (2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 
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10 



15 



20 



35 



(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(E) FEATURES: Xaa at position 8 can be Glu or Lys ; Xaa at y 
position can be Pro, Ser or Ala 

£ii) MOLECULE TYPE : peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Tro Tyr Glu Glu Lys Pro Gly Xaa Xaa Pro Lys Leu Ley lie Tyr 
- 5 10 15 



1 



(2) INFORMATION FOR SEQ ID NO: 13: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
25 (D) TOPOLOGY: linear 

(E) FEATURES: Xaa at position 10 can be Glu or Asp 

(ii) MOLECULE TYPE: peptide 
30 (iii) HYPOTHETICAL: NO 

{iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 



Pro Asp Ala Leu His Gly Pro Phe Ala Xaa Leu Pro His Pro 
1 5 10 



40 (2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

45 (C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(E) FEATURES: Xaa at position 5 can be Gly or Arg; Xaaa 
position 10 can be Glu or Asp 

50 (ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

55 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Pro Asp Ala Leu Xaa Gly Pro Phe Ala Xaa Leu Pro Asn Pro 
1 5 10 

60 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
65 (A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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10 



15 



20 



(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI- SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
Pro val Leu Leu Phe Arg Pro Leu Arg Gly Phe Gin Gin Asp lie 



1 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



25 



(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

13 

30 ggccgacgtg gcc 

(2) INFORMATION FOR SEQ ID NO: 17: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

40 

(ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 
45 (iv) ANTI -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO : 17 : 



50 



gacgtggcct gt 
(2) INFORMATION FOR SEQ ID NO: 18: 



(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

60 (ii) MOLECULE TYPE: DNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

65 (x i) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 



-6- 



WO 99/55911 




PCT/US99/06537 



16 



ggccgacgtg gcctgt 

(2) INFORMATION FOR SEQ ID NO : 19 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

( c ) STRANDEDNESS : s ingle 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

{iii) HYPOTHETICAL: NO 

(iv) ANT I -SENSE : NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 



ccctcatagt taagcgtaac g 



21 



# 



Please type a plus sign (+) inside this box — > 1 +1 . PTO/SB/01 (1 2-97) 

Approved for use through 9/30/00. OMB 0651-0032 
Patent and Trademark Office; U.S. DEPARTMENT OF COMMERCE 
Under the Paperwork Reduction Act of 1995, no persons are required to respond to a collection of information unless it contains 
a valid OMB control number. 



+ 



DECLARATION FOR UTILITY OR 
DESIGN 
PATENT APPLICATION 
(37 CFR1.63) 



□ Declaration 
Submitted 
with Initial 
Filing 



IS Declaration 
OR Submitted after Initial 
Filing (surcharge 
(37 CFR 1.16(e)) 
required) 



Attorney Docket Number 



First Named Inventor 



019815-000200US 



Fang Fang 



COMPLETE IF KNOWN 



Application Number 



Filing Date 



Group Art Unit 



Examiner Name 



09 /674,014 



As a below named inventor, I hereby declare that: 

My residence, post office address, and citizenship are as stated below next to my name. 

I believe I am the original, first and sole inventor (if only one name is listed below) or an original, first and joint inventor (if plural 
names are listed below) of the subject matter which is claimed and for which a patent is sought on the invention entitled: 



IDENTIFYING PEPTIDE LIGANDS OF TARGET PROTEINS WITH TARGET 
COMPLEMENTARY LIBRARY TECHNOLOGY (TCLT) 



the specification of which 

□ 



(Title of the Invention) 



0 



is attached hereto 
OR 

was filed on (MM/DD/YYYY) 



April 19, 1999 



Application Number |pCT/US99/0653l and was amended on (MM/DD/YYYY) 



as United States Application Number or PCT International 

| (if applicable). 



I hereby state that I have reviewed and understand the contents of the above identified specification, including the claims, as 
amended by any amendment specifically referred to above. 

I acknowledge the duty to disclose information which is material to patentability as defined in 37 CFR 1.56. 



I hereby claim foreign priority benefits under 35 U.S.C. 119(a)-(d) or 365(b) of any foreign appiication(s) for patent or inventor's 
certificate, or 365(a) of any PCT international application which designated at least one country other than the United States of 
America, listed below and have also identified below, by checking the box, any foreign application for patent or inventor's certificate, 
or of any PCT international application having a filing date before that of the application on which priority is claimed. 



Prior Foreign Application 
Number(s) 



Country 



Foreign Filing Date 
(MM/DD/YYYY) 



Priority 
Not Claimed 



Certified Copy Attached? 
YES NO 



□ 
□ 
□ 
□ 



□ 
□ 
□ 
□ 



□ 
□ 
□ 
□ 



□ Additional foreign application numbers are listed on a supplemental priority data sheet PTO/SB/02B attached hereto: 
I hereby claim the benefit under 35 U.S.C. 119(e) of any United States provisional application(s) listed below. 



Application Number(s) 



60/083,046 



Filing Date (MM/DD/YYYY) 



April 24, 1998 



| | Additional provisional application 
numbers are listed on a 
supplemental priority data sheet 
PTO/SB/02B attached hereto. 



+ 
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Burden Hour Statement: This form is estimated to take 0.4 hours to complete. Time will vary depending upon the needs of the 
individual case. Any comments on the amount of time you are required to complete this form should be sent to the Chief Information 
Officer, Patent and Trademark Office, Washington, DC 20231. DO NOT SEND FEES OR COMPLETED FORMS TO THIS 
ADDRESS. SEND TO: Assistant Commissioner for Patents, Washington, DC 20231 . 



II 



Please type a plus sign (+) inside this box 



PTO/S8/01 (12-97) 
Approved for use through 9/30/00. OMB 0651-0032 
Patent and Trademark Office; U.S. DEPARTMENT OF COMMERCE 
Under the Paperwork Reduction Act of 1995, no persons are required to respond to a collection of information unless it contains 
a valid OMB control number. 



DECLARATION — Utility or Design Patent Application 



hereby claim the benefit under 35 U.S.C. 120 of any United States appiication(s), or 365(c) of any PCT international application designating the 
United States of America, listed below and, insofar as the subject matter of each of the claims of this application is not disclosed in the prior 
United States or PCT International application in the manner provided by the first paragraph of 35 U.S.C. 112, 1 acknowledge the duty to disclose 
information which is material to patentability as defined in 37 CFR 1 .56 which became available between the filing date of the prior application 
and the national or PCT international filing date of this application. ^ 



U.S. Parent Application or PCT Parent 
Number 



Parent Filing Date 
(MM/DD/YYYY) 



Parent Patent Number 
(if applicable) 



□ Additional U.S. or PCT international application numbers are listed on a supplemental priority data sheet PTO/SB/028 attached hereto, 



As a named inventor, I hereby appoint the following registered practi tioners) to prosecute this app lication and to tra nsact ail business in the Paten t 
and Trademark Office connected therewith: r— J Customer Number | 

OR 



HQ Registered practitioners) name/registration number listed below 



Plkca Customer 
Number Bar Code 



Name 



Registration 
Number 



Name 



Registration 
Number 



Kenneth A. Weber 



3U£Z2_ 



Zl Additional registered practitioner(s) named on supplemental Registered Practitioner Information sheet FTO/SB/02C attached hereto. 



Direct all correspondence to: □ Customer Number 

or Bar Code Label 



Off HD Correspondence address below 



Name 



Kenneth A. Weber 



Address 



Townsend and Townsend and Crew LLP. 



Address 



Two Embarcadero Center, 8th fl 



City 



San Francisco 



State 



ZIP 



Country 



US 



Telephone) 615-576-07.00 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and belief are 
believed to be true; and further that these statements were made with the knowledge that wtWful false statements and the like so made are 
punishable by fine or imprisonment, or both, under 18 U.S.C. TG01 and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. 



Name of Sole or First Inventor: 



D A petition has been filed for this unsigned inventor 



Given Name (first and middle fit anvfl 



Family Nams or Surname 



Inventor 1 * 
Signature 




Date 



Residence: City 



San_D4e^o 



State! CA Country 



US 



Citizenship 1 CN 



Post Office Address 



7155-B Calabria Court 



Post Office Address 



City 



San Dieg (State 



CA 



ZIP 1 991?? 



Country 



D Additional inventors are being named on the supplemental Additional Inventof(s) sheet(s) PTO/SB/02A attached hereto 
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